PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa07g003790.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 830aa    MW: 90231.8 Da    PI: 6.808
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa07g003790.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.27.9e-20134189156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t++q++ Le++F+++ +p++++r +L+++l+L+ rqVk+WFqNrR+++k
  Csa07g003790.1 134 KKRYHRHTPKQIQDLESVFKECAHPDEKQRLDLSRRLNLDPRQVKFWFQNRRTQMK 189
                     688999***********************************************999 PP

2START186.91e-583395582206
                     HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
           START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                     la+ a++elvk+a+ ++p+Wv+ss    + +n++e+ ++f++  +     + +ea+++sg v+ ++  lve+l+d+  +W e+++    + +t+e
  Csa07g003790.1 339 LALGAMDELVKMAQTRDPLWVRSSdtgyDVLNQEEYDTSFSRCVGpkpdgFVSEASKESGTVIINSLALVETLMDSE-RWAEMFPsmisRTSTTE 432
                     67889*******************999966666666666655333677999**************************.*******9999****** PP

                     EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE CS
           START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvd 171
                      is+g      gal+lm+ae+q+lsplvp R + f+R+++q+ +g+w++vdvS+ds ++ + sss+ R   lpSg+l+++++ng+skvtw+eh++
  Csa07g003790.1 433 IISNGmggtrnGALHLMQAEFQLLSPLVPvRQVSFLRFCKQHAEGVWAVVDVSIDSIREGS-SSSCRR---LPSGCLVQDMANGYSKVTWIEHTE 523
                     ************************************************************9.777766...************************ PP

                     --SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 172 lkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     ++g+ +h l+r+l++ gla+ga +w+a+lqrqce+
  Csa07g003790.1 524 YDGNRIHRLYRPLLSCGLAFGAHRWMAALQRQCEC 558
                     *********************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.96E-19122191IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.0E-20125191IPR009057Homeodomain-like
PROSITE profilePS5007117.07131191IPR001356Homeobox domain
SMARTSM003891.9E-18132195IPR001356Homeobox domain
PfamPF000461.8E-17134189IPR001356Homeobox domain
CDDcd000861.20E-17134191No hitNo description
PROSITE patternPS000270166189IPR017970Homeobox, conserved site
PROSITE profilePS5084840.236329561IPR002913START domain
CDDcd088752.33E-110333557No hitNo description
SuperFamilySSF559616.87E-33333558No hitNo description
SMARTSM002343.9E-47338558IPR002913START domain
PfamPF018521.1E-50340558IPR002913START domain
SuperFamilySSF559613.3E-16587821No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 830 aa     Download sequence    Send to blast
MNFNGFLDNT SSGVDGAGAS KLLSDVPYNN NHHFSFSAVD TMLGTTAAIT PSLHSRTPNS  60
RPFSSHGLSL GLTNGEMSRN GEVLLESNVT RKKKTSRGGG EDVTTESRSE SDNAEAVSGD  120
DLDTSDDRPP FKKKKRYHRH TPKQIQDLES VFKECAHPDE KQRLDLSRRL NLDPRQVKFW  180
FQNRRTQMKT QIERHENALL RQENDKLRAE NMSVREAMRN PMCGNCGGPA VLADISMEEQ  240
HLRIENSRLK DELDRVCALT GKFLGRSNGS HYIPDSALVL GVGVGSAGCN GGGGAGGGFT  300
LSSPRFEISG TGSGLATVNH HQPSVSVSDF DHRSRYLDLA LGAMDELVKM AQTRDPLWVR  360
SSDTGYDVLN QEEYDTSFSR CVGPKPDGFV SEASKESGTV IINSLALVET LMDSERWAEM  420
FPSMISRTST TEIISNGMGG TRNGALHLMQ AEFQLLSPLV PVRQVSFLRF CKQHAEGVWA  480
VVDVSIDSIR EGSSSSCRRL PSGCLVQDMA NGYSKVTWIE HTEYDGNRIH RLYRPLLSCG  540
LAFGAHRWMA ALQRQCECLT ILMSSTVSPS PSRTPINCNG RKSMLKLAKR MTDNFCGGVC  600
ASSLQKWSKL NVGNVDEDVR IMTRKSVNIP GEPPGIVLNA ATSVWMPVSP RRLFDFLGNE  660
VLRSEWDILS NGGPMKEMAH IAKGHDHSNS VSLLRASAVN ANQSSMVILQ ETSIDAAGAV  720
VVYAPVDILA MQAVMNGGDS AYVALLPSGF AILPNAQTGT QRCTTEEPNG SGSGEMCMEE  780
GGSLLTVAFQ ILVNSLPTAK LTVESVETVN NLISCTVQKI KAALHCDST*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00418DAPTransfer from AT3G61150Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0508660.0AY050866.1 Arabidopsis thaliana putative homeobox protein (At3g61150) mRNA, complete cds.
GenBankAY0967570.0AY096757.1 Arabidopsis thaliana putative homeobox protein (At3g61150) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010413453.10.0PREDICTED: homeobox-leucine zipper protein HDG1-like isoform X3
SwissprotQ9M2E80.0HDG1_ARATH; Homeobox-leucine zipper protein HDG1
TrEMBLR0FNB90.0R0FNB9_9BRAS; Uncharacterized protein
STRINGfgenesh1_pm.C_scaffold_50020900.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.10.0homeodomain GLABROUS 1